Appl Bioinformatics 2005; 4 (1): 13-24

نویسندگان

  • Terry Clark
  • Josef Jurek
  • Gregory Kettler
  • Daphne Preuss
چکیده

Data management systems are fast becoming required components in many biology laboratories as the role of Abstract computer-based information grows. Although the need for data management systems is on the rise, their inherent complexities can deter the full and routine use of their computational capabilities. The significant undertaking to implement a capable production system can be reduced in part by adapting an established data management system. In such a way, we are leveraging the Genomics Unified Schema (GUS) developed at the Computational Biology and Informatics Laboratory at the University of Pennsylvania as a foundation for managing and analysing DNA sequence data in centromere research projects around Arabidopsis thaliana and related species. Because GUS provides a core schema that includes support for genome sequences, mRNA and its expression, and annotated chromosomes, it is ideal for synthesising a variety of parameters to analyse these repetitive and highly dynamic portions of the genome. Despite this, production-strength data management frameworks are complex, requiring dedicated efforts to adapt and maintain. The work reported in this article addresses one component of such an effort, namely the pivotal task of marshalling data from various sources into GUS. In order to harness GUS for our project, and motivated by efficiency needs, we developed a structured framework for transferring data into GUS from outside sources. This technology is embodied in a GUS object-layer processor, XMLGUS. XMLGUS facilitates incorporating data into GUS by (i) formulating an XML interface that includes relational database key constraint definitions, (ii) regularising traversal through that XML, (iii) realising automatic processing of the XML with database key constraints and (iv) allowing for special processing of input data within the framework for automated processing. The application of XMLGUS to production pipeline processing for a sequencing project and inputting the Arabidopsis genome into GUS is discussed. XMLGUS is available from the Flora website (http://flora.ittc.ku.edu/).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Appl Bioinformatics 2005; 4 (4): 227-246

1 Department of Computer Science, University of Pittsburgh, Pittsburgh, Pennsylvania, USA 2 University of Pittsburgh Cancer Institute, University of Pittsburgh, Pittsburgh, Pennsylvania, USA 3 Clinical Proteomics Facility, University of Pittsburgh, Pittsburgh, Pennsylvania, USA 4 Department of Pathology, University of Pittsburgh, Pittsburgh, Pennsylvania, USA 5 Department of Surgery, University...

متن کامل

Combining signals from spotted cDNA microarrays 1 obtained at different scanning intensities

3 H. P. Piepho* 4 5 Bioinformatics Unit, Institute for Crop Production and Grassland Research, University 6 of Hohenheim, Fruwirthstr. 23, 70599 Stuttgart, Germany 7 8 B. Keller 9 10 Bioinformatics Unit, Institute for Crop Production and Grassland Research, University 11 of Hohenheim, Fruwirthstr. 23, 70599 Stuttgart, Germany 12 13 N. Hoecker 14 15 Center for Plant Molecular Biology, Department...

متن کامل

Bioinformatics and Pharmacogenomics in Drug Discovery and Development - A Socio-Economic Perspective

4 Chapter 1: Introduction 6 Chapter 2: Background and Significance 10 Chapter 3: Problem Statement and Purpose 20 Chapter 4: Methods and Design 24 Chapter 5: Results 31 Chapter 6: Discussion and Future Work 56 Acknowledgements 65 References 66

متن کامل

Information Request

Program(s) of Interest Accounting [1] Animal & Biosciences [2] Applied Nutrition [3] Art History & Visual Culture [4] Bioinformatics [5] Biomedical Sciences [6] Biophysics [7] Biotechnology [8] Business Administration [9] Capacity Development & Extension [10] Chemistry [11] Clinical Studies [12] Computational Science [13] Computer Science [13] Creative Writing [14] Criminology & Criminal Justic...

متن کامل

Information Request

Program(s) of Interest Accounting [1] Animal & Biosciences [2] Applied Nutrition [3] Art History & Visual Culture [4] Bioinformatics [5] Biomedical Sciences [6] Biophysics [7] Biotechnology [8] Business Administration [9] Capacity Development & Extension [10] Chemistry [11] Clinical Studies [12] Computational Science [13] Computer Science [13] Creative Writing [14] Criminology & Criminal Justic...

متن کامل

Information Request

Program(s) of Interest Accounting [1] Animal & Biosciences [2] Applied Nutrition [3] Art History & Visual Culture [4] Bioinformatics [5] Biomedical Sciences [6] Biophysics [7] Biotechnology [8] Business Administration [9] Capacity Development & Extension [10] Chemistry [11] Clinical Studies [12] Computational Science [13] Computer Science [13] Creative Writing [14] Criminology & Criminal Justic...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005